A Metaheuristic Technique for Cluster-Based Feature Selection of DNA Methylation Data for Cancer
نویسندگان
چکیده
Epigenetics is the study of phenotypic variations that do not alter DNA sequences. Cancer epigenetics has grown rapidly over past few years as epigenetic alterations exist in all human cancers. One these methylation; an process regulates gene expression and often occurs at tumor suppressor loci cancer. Therefore, studying this methylation may shed light on different functions cannot otherwise be interpreted using changes occur Currently, microarray technologies; such Illumina Infinium BeadChip assays; are used to extremely large number varying loci. At each site, a beta value (β) reflect intensity. clustering data from various types cancers lead discovery partitions can help objectively classify well identify relevant without user bias. This proposed Nested Big Data Clustering Genetic Algorithm (NBDC-GA); novel evolutionary metaheuristic technique perform cluster-based feature selection based sites. The efficacy NBDC-GA was tested real-world sets retrieved Genome Atlas (TCGA); cancer genomics program created by National Institute (NCI) Human Research Institute. performance then compared with recently developed Immuno-Genetic (IGA) same sets. outperformed IGA terms convergence performance. Furthermore, produced more robust configuration while simultaneously decreasing dimensionality features maximum 67% 94.5% for individual type collective cancer, respectively. also able two chromosomes highly contrasting methylations activities were previously linked
منابع مشابه
Feature selection for DNA methylation based cancer classification
Molecular portraits, such as mRNA expression or DNA methylation patterns, have been shown to be strongly correlated with phenotypical parameters. These molecular patterns can be revealed routinely on a genomic scale. However, class prediction based on these patterns is an under-determined problem, due to the extreme high dimensionality of the data compared to the usually small number of availab...
متن کاملCluster-based pattern discrimination: A novel technique for feature selection
The study of feature selection methods has become an area of intensive research in pattern recognition. In this paper, a new feature selection approach, called cluster-based pattern discrimination (CPD), is introduced. Classes are independently partitioned into clusters to group together similar patterns: a different subspace is defined for each cluster by determining an optimal subset of featu...
متن کاملFeature Selection and Extraction Framework for DNA Methylation in Cancer
Feature selection methods for cancer classification are aimed to overcome the high dimensionality of the biomedical data which is a challenging task. Most of the feature selection methods based on DNA methylation are time consuming during testing phase to identify the best pertinent features subset that are relevant to accurate prediction. However, the hybridization between feature selection an...
متن کاملA New Hybrid Feature Subset Selection Algorithm for the Analysis of Ovarian Cancer Data Using Laser Mass Spectrum
Introduction: Amajor problem in the treatment of cancer is the lack of an appropriate method for the early diagnosis of the disease. The chemical reaction within an organ may be reflected in the form of proteomic patterns in the serum, sputum, or urine. Laser mass spectrometry is a valuable tool for extracting the proteomic patterns from biological samples. A major challenge in extracting such ...
متن کاملa new approach to credibility premium for zero-inflated poisson models for panel data
هدف اصلی از این تحقیق به دست آوردن و مقایسه حق بیمه باورمندی در مدل های شمارشی گزارش نشده برای داده های طولی می باشد. در این تحقیق حق بیمه های پبش گویی بر اساس توابع ضرر مربع خطا و نمایی محاسبه شده و با هم مقایسه می شود. تمایل به گرفتن پاداش و جایزه یکی از دلایل مهم برای گزارش ندادن تصادفات می باشد و افراد برای استفاده از تخفیف اغلب از گزارش تصادفات با هزینه پائین خودداری می کنند، در این تحقیق ...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computers, materials & continua
سال: 2023
ISSN: ['1546-2218', '1546-2226']
DOI: https://doi.org/10.32604/cmc.2023.033632